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ABSTRACT 

Ribosome assembly is a process fundamental for all 
cellular activities. The efficiency and accuracy of the 
subunit assembly are tightly regulated and closely 
monitored. In the present work, we characterized, 
both compositionally and structurally, a set of 
in vivo 50S subunit precursors (45S), isolated from 
a mutant bacterial strain. Our qualitative mass spec- 
trometry data indicate that L28, L16, L33, L36 and 
L35 are dramatically underrepresented in the 45S 
particles. This protein spectrum shows interesting 
similarity to many qualitatively analyzed 50S precur- 
sors from different genetic background, indicating 
the presence of global rate-limiting steps in the 
late-stage assembly of 50S subunit. Our structural 
data reveal two major intermediate states for the 
45S particles. Consistently, both states severally 
lack those proteins, but they also differ in the sta- 
bility of the functional centers of the 50S subunit, 
demonstrating that they are translationally 
inactive. Detailed analysis indicates that the orien- 
tation of H38 accounts for the global conformational 
differences in these intermediate structures, and 
suggests that the reorientation of H38 to its native 
position is rate-limiting during the late-stage 
assembly. Especially, H38 plays an essential role in 
stabilizing the central protuberance, through the 
interaction with the 5S rRNA, and the correctly 
orientated H38 is likely a prerequisite for further 
maturation of the 50S subunit. 

INTRODUCTION 

As the largest ribonucleoprotein complex in the cell, the 
ribosome is responsible for the biosynthesis of proteins in 
all living organisms. Ribosome biogenesis accounts for a 
majority of cellular RNA synthesis activities, and is tightly 



regulated and coupled with growth control pathways (1). 
Despite the complex composition of ribosomal subunits, 
earlier pioneering works led by Nomura (2) and Nierhaus 
(3) have demonstrated that active subunits could be 
reconstituted in vitro from individual rRNAs and 
proteins. Through numerous measurements of binding 
interdependences between different proteins, they have 
constructed the general in vitro assembly maps for the 
30S and 50S subunits (4,5). Their data reveal that riboso- 
mal proteins bind in different orders, displaying both hier- 
archical and parallel manners. 

In the cell, the subunit assembly is highly efficient, 
facilitated by a variety of cofactors with diverse functions, 
including ribonucleases, rRNA helicases and chaperones, 
rRNA and ribosomal protein modification enzymes and 
RNA-binding GTPases (1,6). Disruption of cofactors by 
genetic deletion or mutation leads to growth defects and 
accumulation of subunit precursors. GTPases constitute a 
large part of assembly cofactors (7,8). Although the 
precise roles of these GTPases are still not clear, 
emerging experimental data show that most of them act 
at relatively late stages of the subunit assembly, and par- 
ticularly, some of them might function to couple the 
assembly to other cellular processes, or intervene at differ- 
ent checkpoints to ensure the quality of subunit produc- 
tion (7,9). 

Several bacterial GTPases, such as YlqF, YsxC, EngA, 
ObgE, Era, RsgA and YqeH, have been implicated in the 
assembly of the 50S or 30S subunit (7). Among them, 
YlqF (also known as RbgA) was first identified to be in- 
dispensable for growth in Bacillus subtilis (10,11). YlqF 
belongs to an unusual GTPase family, featuring a circu- 
larly permuted GTPase domain (12). YlqF homologs are 
widely present in gram-positive bacteria, archaea and all 
eukaryotes, and also found in a few gram-negative 
bacteria (but not Escherichia coli) (13,14). Depletion of 
YlqF in B. subtilis confers a slow growth phenotype, 
and induces an abnormality of the in vivo ribosome 
profile, with an increased level of 45S precursors and 
a decreased level of 70S ribosomes (15-17). And the 
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premature 45S particles accumulated in YlqF-depleted 
cells unambiguously lack several proteins, including LI 6, 
L27 and L36 (15,16,18), indicating that these 45S particles 
are a specific category of in vivo assembly intermediates. 

To understand the structural change of the 23 S rRNA 
during conformational maturation and to investigate 
possible quality control mechanism on the 50S subunit 
assembly, in the present work, we combined quantitative 
mass spectrometry (QMS) and cryo-electron microscopy 
(cryo-EM) to characterize the premature 45S particles 
isolated from an YlqF-deficient B. subtilis strain (16). 
The QMS data indicate that a few proteins, such as 
L28, LI 6, L33, L36 and L35, are dramatically 
underrepresented in the 45S particles. This protein 
spectrum shows interesting similarity to a number of 
qualitatively analyzed 50S precursors from E. coli, 
indicating the presence of global rate-limiting steps in 
the late-stage assembly of bacterial 50S subunit. More im- 
portantly, the structural analysis reveals several major 
conformations of the 23S rRNA in the 45S particles, dif- 
fering in the stability of the functional centers of the 50S 
subunit and the orientation of a long rRNA helix H38. 
These structural data indicate that the 45S particles are 
defective in both subunit association and tRNA binding. 
Interestingly, further comparison of two intermediate 45S 
conformations shows that the reorientation of H38 from 
its intermediate non-native position to the native-like con- 
formation is likely a rate-limiting step, which would lead 
to a global stabilization of the whole central protuberance 
(CP) of the 50S subunit. Together with published data, 
our results suggest that the mutual stabilization between 
the CP and H38 is crucial for the further maturation of the 
23S rRNA in the functional centers, and assembly factors, 
such as YlqF, come into play to facilitate this transition to 
accelerate the late-stage 50S subunit maturation. 

MATERIALS AND METHODS 

Purification of mature and premature 50S ribosomal 
subunits 

Mature 50S subunits were purified from wild-type 
B. subtilis 168 strain, and premature 45S particles from a 
previously constructed YM01 strain (16), in which the 
genomic ylqF gene was placed behind the isopropyl [3-D- 
1-thiogalactopyranoside (IPTG)-inducible spac promoter. 
Both strains were grown in Luria-Bertani medium at 37°C 
without IPTG. Cells were collected through centrifuga- 
tion, when OD600 reached 0.4-0.6, to avoid the formation 
of spores, in which most of the ribosomes are in the form 
of 100S. Cell pellets were resuspended with opening buffer 
[20mM Tris-HCl, pH 7.5, 100 mM NH 4 C1, 10.5 mM 
Mg(OAc) 2 , 0.5 mM EDTA, 1 mM TCEP (Tris(2- 
carboxyethyl)phosphine)], and disrupted by sonication. 
Cell debris was removed by centrifugation at 20000rpm 
(Avanti J-26 XP, Beckman Coulter) for 40min. The super- 
natant was applied onto 5 ml sucrose cushion buffer [20 
mM Tris-HCl, pH 7.5, 500 mM NH 4 C1, 10.5 mM 
Mg(OAc) 2 , 0.5 mM EDTA, 1.1 M sucrose, 1 mM TCEP] 
and centrifuged using a 70Ti rotor (Beckman Coulter) at 
28 000rpm for 19 h. The pellets were washed with 2.0 ml 



washing buffer [20 mM Tris-HCl, pH 7.5, 500 mM 
NH 4 C1, 10.5 mM Mg(OAc) 2 , 0.5 mM EDTA, 1 mM 
TCEP] and subjected to an additional centrifugation 
using 5 ml sucrose cushion. The final ribosomal pellets 
were washed and dissolved in buffer I [20 mM Tris- 
HCl, pH 7.5, lOOmM NH 4 C1, 15mM Mg(OAc) 2 , 1 mM 
TCEP]. 

For mature 50S subunits, the crude ribosomal mixture 
was subjected to a 10^10% sucrose density gradient [15 
mM Mg(OAc) 2 ] centrifugation using a SW32 rotor 
(Beckman Coulter) at 30000rpm for 8h. Fractions cor- 
responding to the 70S peak were collected and applied 
onto a 10-40% sucrose gradient [2 mM Mg(OAc) 2 ], to 
separate the 30S and 50S subunits. Fractions correspond- 
ing to the 50S peak were pooled and diluted in buffer II 
[20 mM Tris-HCl, pH 7.5, 100 mM NH 4 C1, 10 mM 
Mg(OAc) 2 , 1 mM TCEP]. 

For premature 45S particles, the crude ribosomal 
mixture was incubated with excessive mature 30S 
subunits at 37°C for lOmin, and subjected to a 10^10% 
sucrose gradient [15 mM Mg(OAc) 2 ] in a SW32 rotor 
(Beckman Coulter) centrifuged at 30000rpm for 8h. 
Fractions corresponding to the 45S peak were pooled 
and diluted in buffer II. Two independent biological rep- 
licates (batch 1 and batch 2) of the 45S samples were simi- 
larly obtained and subjected to further analysis. 

Quantitative mass spectrometry 

To quantify the relative ratios of ribosomal proteins in the 
two samples, the tandem mass tags (TMTs) labeling 
method was used (19). The detailed procedures were 
described previously (20). For normalization purpose, 
samples with same A260 absorption value were resolved 
by protein gel. Target proteins were recovered and 
digested using trypsin (Promega). Peptides were labeled 
with TMT reagents (Thermo, Pierce Biotechnology). 
TMT 126 and 127 were used for the 45S (batch 1) and 
50S samples, respectively. Labeled peptides were subjected 
to Liquid chromatography-tandem mass spectrometry 
analysis. Ratios of two or more tryptic peptides from 
the same protein were used to calculate the means and 
standard deviations (Supplementary Table SI). 

Cryo-EM and image processing 

Cryo-grids were prepared as previously described (20). All 
images were collected using an FEI Titan Krios at 300 kV 
under low-dose conditions (20e~/A 2 ) on an FEI Eagle 
4K x 4K CCD camera. Preprocessing of micrographs 
and particle selection were performed with SPIDER 
(21). For the mature 50S sample, 4841 micrographs were 
collected at a magnification of 75 000 x . A total of 152 540 
particles were obtained and subjected to a standard refer- 
ence-matching refinement using SPIDER (21). The final 
resolution is 7.6 A (FSC 0.5 criterion) for the mature 50S 
map. For the premature 45S sample, 2019 (batch 1) and 
2925 (batch 2) micrographs were collected at a magnifica- 
tion of 59 000 x. Particles of the 45S samples were sub- 
jected to reference-free 2D and 3D classifications using 
RELION (22). Different parameters and class numbers 
have been tested. On the basis of the classification 
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results, 30S, 70S and junk particles were discarded 
(Supplementary Figure SI). For the batch 1 data, 3D 
classification of final 91 323 particles rendered three 
conformational states (Supplementary Table S2 and 
Supplementary Figure S2). For the batch 2 data, final 
201 876 particles were subjected to 3D classification, and 
four conformational states of the 45S particles were 
obtained (Supplementary Table S2). These intermediate 
structures were further grouped into two major states 
(I and II), according to their map features. All the 
density maps were sharpened using a B-factor approach 
(23). The statistics of image processing and summary of 
the map features are presented in Supplementary Table 
S2. According to the 3D classification results, ~19% of 
the particles from the batch 2 data are mature 50S par- 
ticles, but virtually no mature 50S particles were found 
in the batch 1 data (Supplementary Table S2). This is 
probably due to experimental variations during the 
sucrose gradient fractionation. 

Atomic model of the B. subtilis 50S subunit and 
flexible fitting 

Models of the 23S and 5S rRNAs were built using the 
crystal structures of the 50S subunits from E. coli (PDB 
ID: 2AW4) (24) and Thermus thermophilus (PDB ID: 
2J01) (25). Sequence alignments of the 23S and 5S 
rRNAs were constructed with the software S2S 
(Sequence to Structure) (26). ModeRNA (27) was used 
for building RNA models. Models of ribosomal 
proteins, LI, L3, L4, L6, L13, L14, L15, L17, L19, L20, 
L21, L22, L23, L24, L27, L29, L30, L31, L32, L33, L34, 
L35 and L36 were downloaded from the SWISS-MODEL 
Repository (28). The others, including L2, L5, Lll, L16, 
LI 8 and L28 were modeled using MODELLER (29) with 
crystal structures of E. coli (24) and T. thermophilus (25) 
50S subunits as templates. 

The atomic model of the B. subtilis 50S subunit was 
docked into the 7.6 A mature 50S density map, optimized 
by the means of Molecular Dynamics Flexible Fitting (30). 
Atomic models of the structures of the 45S particles were 
similarly obtained. The temperature maps of the prema- 
ture 45S structures were constructed using an extension 
module of PyMOL (31). The scripts (rmsd_b.py and 
color_b.py) were downloaded from http ://pld server 1. 
biochem.queensu.ca/~rlc/work/pymol/. Three-dimen- 
sional distance deviations of the 23S rRNA atoms in the 
45S models, relative to the mature 50S model, were 
computed and stored in the B-factor column. The 
ColoRNA (32) was used to draw different colors to 2D 
secondary structure diagrams of the 23S rRNA based on 
values of the B-factor column. PyMOL and Chimera (33) 
were used for structural visualization. 

RESULTS 

Compositional analysis of the 45S particles from the 
YlqF-deficient strain 

We isolated the premature 45S particles from a previously 
established B. subtilis YM01 strain (18), in which the 
genomic ylqF gene was conditionally inactivated by an 



IPTG-inducible spac promoter. Tricine-sodium dodecyl 
sulphate-polyacrylamide gel electrophoresis analysis of 
the 45S particles and mature 50S subunits dissociated 
from the 70S ribosomes confirmed that at least L16 and 
a few other proteins are apparently missing from the 
45S particles (Figure 1A). To characterize possible 
underrepresented proteins that are beyond the polyacryl- 
amide gel electrophoresis resolution, an isobaric tag-based 
mass spectrometry (34) was used to determine the 
occupancies of ribosomal proteins in the 45S particles 
relative to the mature 50S subunits. 

As a result, QMS data show a specific pattern, with a 
dozen of the 50S proteins underrepresented at varying 
levels in the 45S sample (Supplementary Table SI). In 
contrast, the rest 20 proteins are comparable in both 
samples. This distinctive protein spectrum indicates that 
the 45S particles from the YlqF-deficient strain are a 
specific set of assembly intermediates, and secondary 
effect from the impaired ribosomal protein supply is 
minimal. Among underrepresented proteins, L28, L16, 
L33, L36 and L35 were detected to be dramatically 
reduced in the 45S particles (Figure IB). Especially, L28 
and L16 are virtually absent (~10%) from the 45S par- 
ticles, considering the inevitable contamination of free 50S 
subunits during the 45S sample preparation 
(Supplementary Figure SI). In addition, proteins L13, 
LI 5, L31 and L27 show moderately reduced levels, 
ranging from 36% to 45%. Structural mapping of these 
proteins indicates that most of them are in the neck region 
of the CP (Supplementary Figure S3). In fact, all of the CP 
proteins are underrepresented, except two 5S rRNA- 
binding proteins L5 and LI 8. These results agree well 
with early in vitro data (5,35) showing that these 
underrepresented proteins are at the branch ends of the 
in vitro 50S assembly map. Also, taking into consideration 
of the previous data that the 5S rRNA, L5 and L18 in- 
corporate as a preformed ribonucleoprotein complex at 
the beginning stage of the CP assembly (5,36,37), it dem- 
onstrates that these 45S intermediates represent a late 
stage of the 50S assembly after the incorporation of the 
5S rRNA. 

In addition, we also found that two proteins, Yqel and 
YqeL, are enriched in the 45S sample (Figure IB). The E. 
coli homologs of them (YhbY and YbeB, respectively) 
were also reported to cofractionate with the large 
subunit fractions (34,38). This indicates that these two 
proteins are likely involved in the ribosome-related 
function as well. 

Image analysis of the 45S particles from the 
YlqF-deficient strain 

Next, we used cryo-EM to probe the structural viability of 
the 45S particles. At the 2D level, unsupervised classifica- 
tion of particles results in nicely resolved class averages 
(Supplementary Figure SI). A prominent feature of these 
average images is that all of them display a stable body 
domain with discernible fine details, but most of them also 
exhibit large flexibility at the CP, LI and L7/L12 stalks. 
In some cases, densities at these regions are nearly 
smeared out. 
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Figure 1. Compositional analysis of the 45S particles by QMS. 
(A) Protein gel analysis of the mature 50S (Mature) and premature 
45S samples (YlqF~). The ribosomal protein bands that are missing 
or weak in the 45S sample are labeled with asterisks. (B) QMS 
analysis of the occupancies of the 50S proteins in the 45S sample. 
Standard deviations are shown by error bars. 



It is known that the two peripheral stalks of the 50S 
subunit are intrinsically dynamic. However, the proper 
function of the ribosome requires the rigidity of the CP, 
as it not only participates in the intersubunit association, 
but also directly interacts with the D-loops of the A-site 
and P-site tRNAs to form the essential tRNA corridor on 
the 70S ribosome (24). Therefore, from the 2D analysis 
alone, it is likely that the 45S particles are functionally 
defective and unable to associate with the 30S subunit. 

Cryo-EM structures of the 45S assembly intermediates 

At the 3D level, with extensive tuning of classification 
parameters, we were able to sort the heterogeneous 
45S particles into several well-defined groups, and 
obtain a series of structures at near 1 nm resolution 
(Supplementary Table S2). Classification of two independ- 
ent biological replicates of the 45S samples produced simi- 
lar results (Supplementary Table S2 and Supplementary 
Figure S2). On these maps, many secondary structural 
elements of the body domain can be easily identified, 
demonstrating the validity of the classification 
(Figure 2). Based on the map features, these intermediate 
maps were grouped into two major conformational states 
(I and II) (Figure 2, Supplementary Figure S2 and 
Supplementary Table S2). The first state (I-a), accounting 



for nearly one-third of particles, is in a conformation gen- 
erally similar to the mature 50S subunit (Figure 2B and E). 
In contrast, the second state (Il-a and Il-b), derived from 
~70% of particles, displays a floppy CP and a well- 
resolved body domain (Figure 2C and F). When the Il-a 
map is displayed at normal threshold, the whole CP 
domain is invisible (Figure 2F), indicating that the CP in 
the 45S particles is highly dynamic. At relatively low 
threshold, bulk densities of the CP begin to show up 
and appear to have flipped toward the LI stalk direction 
(Figure 21, Supplementary Figure S2). 

Highly consistent with the QMS data, analysis of the 
two states of the 45S particles reveals that a number of 
proteins are evidently missing from the density maps 
(Figure 2B and C and Supplementary Movie S 1 S3). 
Most of these missing proteins bind at multiple helical 
junctions, underlining their contribution to the stabiliza- 
tion of the 23S rRNA conformation. Therefore, our struc- 
tural and compositional analysis suggests that the 
incorporation of the CP-binding proteins is rate-limiting 
in the late-stage maturation of the 50S subunit, and the 
tertiary interactions of these proteins with the 23S rRNA 
are likely crucial for the maturation of the 23 S rRNA at 
the CP. 

Functional centers of the 45S particles are highly flexible 

In addition to the floppy CP domain, these assembly inter- 
mediates also differ at varying extents in other regions. 
Especially, functional centers of the 50S subunit, as well 
as rRNA helices involved in tRNA binding, deviate 
largely from the structure of the mature 50S subunit. 

Firstly, the peptidyl-transferase center (PTC), which 
spans RNA helices 89-93, shows significant conform- 
ational differences among these structures. This functional 
center is the most important enzymatic center of the 
ribosome, which interacts with the CCA-ends of the A- 
site and P-site tRNAs, and directly participates in the 
peptide elongation and release reactions (39). None of 
the 45S maps exhibits completely resolved PTC, and espe- 
cially, H89 appears to be extremely dynamic (Figure 3A). 

Secondly, the helices 67-71 of the 23S rRNA are also 
highly unstable. The long rRNA stem H68 is invisible in 
all the structures of the 45S particles (Figure 2 and 
Supplementary Figure S2). H69 is highly flexible in both 
conformational states (Figures 2 and 3), and appears to be 
more bent toward the 30S direction in the maps of the 
state II (Figure 3B). Notably, H69 is an essential compo- 
nent of the intersubunit bridge (B2a) and also directly 
involved in coordinating tRNA movement during transla- 
tion (40). 

Thirdly, a significant difference lies in the position of 
H38 of the 23S rRNA. H38 is also called the A-site finger, 
which interacts with the A-site tRNA during translation, 
and has been demonstrated to have a number of func- 
tional implications (41^13). The terminal loop of H38 
also interacts with the 30S subunit to form an intersubunit 
bridge Bla (24). In the I-a map of the state I, H38 is in a 
native-like conformation, with its tip bent toward the 
body domain by ~15A (Figure 4A-E). In contrast, in 
the structures of the state II, H38 is seen to have a 
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Mature l-a I l-a 

Figure 2. Structural analysis of the 45S particles. (A) Surface representation of the cryo-EM map of the mature 50S subunit. (B-C) Surface 
representation of the 45S maps in the I-a (B) and Il-a (C) states, superimposed with their missing components. (D-F) Surface representation of 
the cryo-EM maps of the mature 50S subunit (D), I-a (E) and Il-a (F) states of the 45S particles, with their respective flexibly fitted atomic models 
superimposed. For simplification, only I-a and Il-a states are shown. (G-I) Representative class averages of the cryo-EM raw images for the three 
conformational states (Mature, I-a and Il-a), produced by a reference-free classification. 



massive displacement, rotated ~17° toward the L7/L12 
stalk direction, forming a non-native interaction with 
Lll (Figure 4F-J). Notably, this 'trapped' conformation 
of H38 in the state II totally blocks the binding site of L16 
(Figure 41), which readily explains why the LI 6 level is 
extremely low in the 45S particles. 

Taking together, our structural analyses show that the 
functional centers of the 45S particles are not correctly 
assembled, indicating that these 45S particles are defective 
not only in intersubunit association but also in tRNA 
binding. 

Domains IV and V of the 23S rRNA are not correctly 
assembled in the 45S particles 

To understand the 23 S rRNA folding in a global scale, we 
computed temperature factors for the 23S rRNAs in these 
45S structures according to their deviations from the 
structure of the mature 50S subunit (Figure 5). The tem- 
perature factors were mapped to both the 3D structures 
(Figure 5 and Supplementary Figure S4) and 2D second- 
ary maps (Supplementary Figure S5). 

As a result, a clear pattern starts to appear. Domains IV 
and V of the 23S rRNA are the most affected regions. The 
3' half of Domain IV consists of three helices, H68, H69 
and H71, and all of them are highly dynamic (Figure 5C 
and 5F, Supplementary Figure S4). In contrast, the 5' half 



of Domain IV, which forms part of the body domain, 
remains to be relatively stable (Figure 5C and F). 
Domain V accounts for three major functional compo- 
nents of the 50S subunit, the LI stalk (H76-78), the 
PTC (H89-93) and the CP (H80-88). All these 
three parts display large conformational differences 
(Figure 5D and 5G, Supplementary Figure S4), especially 
in the conformational state II. The LI stalk is visible in all 
the structures, but in different positions (Figure 5 and 
Supplementary Figure S4). The other domains of the 
23S rRNA (I, II, III and VI) appear to be relatively 
stable, except that H38 deviates from its mature conform- 
ation, and the L7/L12 stalk base (H42^14) also shows 
some structural viability (Figure 5B and E). These tem- 
perature maps show that all the helices involved in the 
formation of tRNA corridor on the 50S subunit are 
highly flexible. 

On the 2D secondary maps (Supplementary Figure S5), 
it is clear that the 23S rRNA does not mature in a lin- 
ear manner, and both the 5' and 3' domains of the 23S 
rRNA in the 45S structures are already correctly 
assembled. The maturation of the 23S rRNA at Domain 
IV and V appears to be the slowest step. Together with our 
QMS data, this indicates that the assembly of the CP is 
likely rate-limiting, and requires the assistance from 
assembly factors. 
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Figure 3. Conformational variability of the PTC and H69 regions in 
the 45S structures. (A) Thumbnail of the mature 50S subunit viewed 
from the intersubunit side. (B) Close-up view of the PTC. H80, H89, 
H90, H91, H92 and H93 are colored yellow, blue, red, magenta, green 
and purple, respectively. All the maps in (C-F) are superimposed with 
their respective atomic models. As shown, none of the intermediate 
maps (I-a, Il-a and Il-b) display a completely resolved PTC, indicating 
a large flexibility at these helices. (G) Thumbnail of the mature 50S 
subunit viewed from the L7/L12 side. (H) Close-up view of the H69 
region. H67, H68, H69, H70, H71 and the PTC are colored red, green, 
purple, blue, yellow and black, respectively. All the maps in (I-L) are 
superimposed with the atomic model of the mature 50S structure. 

DISCUSSION 

Reorientation of H38 is a major rate-limiting step of the 
late-stage 50S subunit assembly 

In the present work, we characterized a set of in vivo 45S 
assembly intermediates, which reveals two major conform- 
ational populations. These two states mainly differ in the 
stability of the CP and the orientation of H38. 
Comparison of the two states indicates that the reorienta- 
tion of H38 to its native-like position leads to a global 
stabilization of the rRNA structure at the CP (Figure 2E 
and F). Apparently, this stabilization of the CP in the state 
I is a direct consequence of the interaction of the 5S rRNA 
with H38, considering the topology of them in the 50S 
structure (Supplementary Figure S3). This suggests that 
the 5S rRNA serves as an important bridge in the 
mutual stabilization between the CP rRNA components 
and H38. Consistently, both the omission of the 5S rRNA 
(44) from the in vitro reconstitution and the depletion of 
the 5S rRNA binding protein L5 in vivo (36) result in 
significantly reduced incorporation of the CP proteins 
into the 50S assembly intermediates. Protein compositions 
of the two states are similar, both severally lacking the 
CP-binding proteins (Figure 2), except that L30 is 
present in the I-a state (Supplementary Figure S6, 
Supplementary Table S2 and Supplementary Movie SI). 



L30 binds exactly at the interface between the 5S rRNA 
and the stem base of H38 (Supplementary Figures S3 and 
S6), suggesting that the binding of L30 might partially 
contribute to the reorientation of H38. 

The ratios of the 45S particles in the state I to II 
are 1:2.6 and 1:4.1, for the two biological replicates of 
the 45S samples, respectively (Supplementary Table S2), 
indicating that most of the 45S intermediates are trapped 
in the state II. This suggests that the reorientation of H38 
(from states II to I) is likely rate-limiting. Our structural 
data show that LI 6 is incompatible with H38 in the state 
II, implying that the binding of LI 6 would require the 
breaking of existing non-native interaction between H38 
and the L7/L12 stalk base and the reestablishing the native 
interaction between H38 and the 5S rRNA (Figure 4). 
Consistently, LI 6 is exactly a common protein detected 
to be missing from various forms of premature particles 
(45^19) (Supplementary Table S3). Along this line, early 
in vitro reconstitution experiments showed that the incorp- 
oration of LI 6 into the late-stage particles requires a large 
activation energy and is temperature dependent (50). 
Therefore, altogether, it indicates that the native conform- 
ation of H38 is probably not established in these different 
assembly intermediates, suggesting that the reorientation 
of H38 is a global rate-limiting step in the late-stage 
assembly pathways. In support of this view, it was 
already found that mutations on H38 lead to a defect in 
the 50S subunit assembly in E. coli (43). 

Furthermore, we noticed that H38 in the state I is in a 
near-native conformation, still partially overlapping with 
the L16 binding sites (Figure 4D). There are 28% and 
20% of 45S particles trapped in the state I, for the batch 
1 and batch 2 data, respectively (Supplementary Table 
S2), indicating the presence of unidentified rate-limiting 
steps after the 45S particles acquire the state I conform- 
ation. This suggests that H38 might undergo a series of 
structural remodeling during the late-stage assembly, and 
each step could be coupled to certain protein binding 
events. 

It is known that H38 is also dynamic on the 70S 
ribosome, as required for the reorganization of 
intersubunit bridge Bla during the ratchet-like motion 
(51,52). This dynamic behavior is enabled by a unique 
kink-turn motif (53) at the base of H38 (54). However, 
its conformational change on the 70S ribosome is in a 
much smaller scale, compared with its orientations in 
our 45S structures. Taking together with published data 
on the function of H38 in translation (41-43,55), it 
suggests that H38, as an intrinsic flexible element, plays 
roles in both translation and ribosome assembly. 

The late-stage assembly of the 50S subunit in vivo 

Protein composition of the 45S particles shows that L28, 
L16, L33, L36 and L35 are in significantly reduced levels. 
This pattern of underrepresented proteins shows interest- 
ing similarities to the protein spectra of the in vivo 40S or 
45S particles isolated from various E. coli or B. subtilis 
strains with genes for different assembly cofactors dis- 
rupted, including csdA, srmB, rrmJ, clnaK, dbpA, engA, 
yphC, ysxC and obgE (17,45^19,56,57). Among these 
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Figure 4. Conformational variability of H38 of the 23S rRNA in the 45S intermediates. Structural comparison of H38 in the I-a (B-E) or Il-a (G-J) 
structures, with that of the mature 50S structure. Two different thumbnails of the mature 50S subunit (A and F), with a few components highlighted 
in different colors, are shown in left panels. (B, C, G and H) Close-up view of the mature 50S (B and G), the I-a (C) and Il-a (H) structures in the 
same area as in their respective thumbnails, with their atomic models superimposed. (D and I) The I-a (D) and Il-a (I) structures are superimposed 
with the atomic model of the mature 50S structure. (E and J) Conformational change of H38 between the mature and the I-a (E) or Il-a (J) state. 
The 5S rRNA, H38, H89 and H42-44 are colored blue, red, green and yellow, respectively. Ribosomal proteins, L16, L36, L30 and Lll are colored 
black, golden, purple and dark green, respectively. For clarification, the 23S rRNA helices colored gray in the upper left thumbnail (A) are not 
shown in (B-E). 



Domain l-III.VI Domain IV Domain V 




Figure 5. Temperature maps of the 23S rRNA in the 45S intermediates. (A) Cartoon representation of the 23S rRNA, with its domains colored 
separately. (B-G) Temperature maps of the domains of the 23S rRNA, colored according to their deviations from the conformation of the mature 
50S subunit. For simplification, only I-a (B-D) and Il-a (E-G) states are shown. Individual rRNA helices are labeled. The rRNA helices that are not 
well resolved in cryo-EM maps are colored black (highly flexible). CP, central protuberance; L7/L12 SB, L7/L12 stalk base. 



factors, CsdA, DbpA and SrmB are DEAD-box RNA 
helicases. DnaK and RrmJ are heat-shock proteins with 
specialized functions, as general protein chaperone and 
23S rRNA methyltransferase, respectively. And ObgE, 
EngA, YphC and YsxC are all GTPases involved in 
ribosome biogenesis. These different sets of in vivo 
assembly intermediates, obtained by perturbation of pre- 
sumably distinctive assembly events, however, share a 
similar deficiency in LI 6, L27, L33, L34, L35 and L36 



(Supplementary Table S3). In addition to the perturbation 
of assembly factors, deletion of certain ribosomal protein 
genes, such as L5, L27 and L28, also induces the accu- 
mulation of in vivo precursors lacking these proteins 
(36,58-60). Therefore, it indicates that the assembly of 
these late-stage proteins is likely bottlenecks in vivo and 
susceptible to various forms of disruption. 

However, with a wealth of the above semiquantitative 
data, it remains impractical to determine the exact binding 
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Figure 6. A major branch of the late-stage assembly of the bacterial 
50S subunit in vivo. (A) Cartoon representation of the 45S intermediate 
in the state II, featuring a loosely attached 5S rRNA, a flexible CP 
domain (dash lines) and a 'trapped' conformation of H38 (yellow lines). 
(B) Cartoon representation of the 45S intermediate in the state I, where 
the rigidity of the CP is achieved and the interaction between the 5S 
rRNA and H38 is established. H38 is in a native-like conformation in 
state I. (C) Cartoon representation of the mature 50S structure. There 
are two major rate-limiting transitions in this assembly branch, the 
state II to I, and the state I to the mature conformation. Both transi- 
tions involve the structural remodeling of H38. 

order of these late-stage proteins because these data only 
reflect the compositions of various kinetically trapped 
in vivo intermediates in equilibrium state. The Nierhaus 
map, on the other hand, did not explicitly test all 
possible binding dependences among these late-binding 
proteins. Recently, with the time-resolved pulse-labeling 
coupled QMS technique, Williamson lab quantified the 
ribosomal protein levels throughout the whole sucrose 
gradient of the cell lysate from fast-growing wild-type 
E. coli cells, and successfully assigned the 50S proteins in 
six temporal groups (61). All the proteins with highly 
underrepresented levels in our 45S sample (L28, L16, 
L33, L36 and L35) are in the latest group, except that 
L28 and L33 are in the fifth group. Through the integra- 
tion of all available data, we could isolate a major 
pathway for the late-stage assembly of the 50S subunit 
in vivo, with a focus on the CP-binding proteins only 
(L16, L27, L30, L31, L33, L35 and L36). As shown, 
there are two major assembly intermediates (state I and 
II) in this assembly branch (Figure 6). The first transition 
from the state II to I is facilitated by L30, which places 
L30 at a relatively earlier stage. This assignment is con- 
sistent with the Nierhaus map and the recent time-resolved 
data (61). It is difficult to unambiguously further time 
stamp the following CP proteins in the second transition 
(from state I to mature) owing to the lack of kinetic data 
with sufficient temporal resolution. Both transitions are 
rate-limiting because our structural data show that the 
45S particles are trapped in both the state I and II. It 
must be noted that the second transition should consist 
of multiple steps, of which the crucial rate-limiting ones 
remain to be investigated. 

In terms of the 23S rRNA maturation, our results show 
that it does not proceed in a 5'-3' transcriptional order, 
with the central domains IV and V being the last. Notably, 
this maturation order was also predicted by the above- 
mentioned pulse labeling QMS study (61). According to 
their results, the CP proteins in fact constitute the latest 
assembly group (61). Interestingly, this non-linear matur- 
ation of the 23S rRNA is in sharp contrast to the assembly 
of the 16S rRNA. Previous in vitro studies indicate that 
the 16S rRNA matures roughly in a 5—3' order (62,63), 
with the 3' head domain at last, manifesting a 



co-transcriptional nature for the 30S subunit assembly 
in vivo (64). Despite this difference, the co-occurrence of 
the hypolevels of ribosomal proteins and structural 
instability of the rRNAs, observed at the CP of the 45S 
particles, was also found at the 3' domain of the 16S 
rRNA during the late-stage assembly of the 30S subunit. 
Therefore, similar to what is known for the assembly of the 
3' domain of the 30S subunit (62,65-67), there must also be 
a high degree of cooperativity between the protein binding 
and RNA remodeling at the CP of the 50S subunit. 

Molecular role of YlqF as an rRNA chaperone 

Ribosomal subunits are gigantic nucleoprotein complexes, 
and their structures are rich in RNA-RNA and RNA 
protein tertiary interactions. The energy landscape of the 
assembly of the ribosomal subunits is therefore inevitably 
decorated with various kinetic traps, corresponding to 
certain mis-assembled rRNA or slow protein-binding 
events. Assembly factors, as well as some ribosomal 
proteins, are believed to act as RNA chaperones to 
prevent or rescue certain non-native rRNA intermediates 
(62,68). 

Previously, based primarily on the protein composition 
in the 45S particles from YlqF-deficient cells (15,18,69), 
the role of YlqF was emphasized in the assembly of LI 6, 
L27 and L36. However, these proteins were commonly 
detected to be missing from various in vivo assembly inter- 
mediates (Supplementary Table S3). This means that the 
most direct role of YlqF is probably not in protein 
assembly. In the present work, we found that H38 is in 
an abnormal trapped conformation in the structures of 
conformational state II, and the reorientation of H38 
appears to be a prerequisite for downstream assembly 
events. This suggests that YlqF likely facilitates the struc- 
tural remodeling that contributes to the interaction of H38 
with the 5S rRNA. Consistent with this view, the putative 
binding sites of YlqF, mapped by dimethyl sulfate foot- 
printing, are located to the CP (H81 and H85) and H38 
(16). Therefore, YlqF might act as a chaperone to lower 
the energy barrier required for the reorientation of H38. In 
further support of this proposed role, the low intrinsic 
GTPase activity of YlqF could be simulated by both the 
mature 50S and premature 45S subunits (16,18,69). 
However, the mature 50S subunit has a more marked 
effect on the GTPase activation of YlqF (69). Thus, 
combining all these data, sequential actions of YlqF 
could be reasoned as follows. YlqF in GTP-bound state 
first binds to the 45S particle and facilitates the reorienta- 
tion of H38. Once H38 achieves its native-like conform- 
ation, the mutual stabilization between the CP and H38 in 
turn accelerates the further assembly of the CP, allowing 
an efficient incorporation of the CP proteins. The fully or 
nearly matured 50S subunit simulates the GTPase activity 
of YlqF, and YlqF finally releases in GDP-bound state 
upon GTP hydrolysis. 

Intrinsic quality control of the ribosomal subunit 
production 

Translation is such a fundamental process for all cellular 
activities that the quality of the assembly intermediates 
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along the assembly pathways has to be closely monitored. 
Some assembly GTPases are believed to act as checkpoint 
proteins at expense of GTP molecules, e.g. to ensure a 
certain intermediate conformation or assembly event (9). 
Although our data is more consistent with an rRNA chap- 
erone role for YlqF, it is possible that an unidentified 
assembly event in the final stages of the 50S assembly, 
which is responsible for triggering the GTP hydrolysis 
on YlqF, could be used as a structural checkpoint. In 
this alternative mechanism, YlqF might associate with 
the 45S particle and act as a physical blocker to down- 
stream assembly events until a certain crucial step is 
finished. This interpretation provides a different view on 
the role of YlqF, with an emphasis in the quality control 
of the 50S subunit assembly. 

Another interesting finding is that the 45S particles, 
whether in the state I or II, display large flexibility in 
intersubunit bridging and tRNA contacting sites. This 
observation shows intriguing similarity with previous 
structural information on the premature 30S subunit pre- 
cursors. Early chemical or hydroxyl radical probing of the 
16S rRNA conformation (63,65-67,70), as well as direct 
visualization of the 30S precursors by cryo-EM (20,71) all 
indicate that the 3' domain of the 16S rRNA remains 
largely immature until a very late stage. Especially, the 
3' minor domain of the 16S rRNA, which includes an 
essential long helix (h44), is not correctly assembled in 
the assembly intermediates. Notably, h44 is a functional 
component of the decoding center on the 30S subunit, and 
is also indispensable for both subunit association and 
tRNA interaction. In combination, these data reveal 
that both the 50S and 30S subunits in premature 
forms are functionally defective. This could have a physio- 
logical relevance, as it might demonstrate an additional 
level of quality control of protein translation, by prevent- 
ing the escape of premature subunits into the translation 
pool. 
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